'--put file Flhl4273new; Output File Flhl4273tra 



Sequence length 1743 



Sdapi-rsleqanrtrfpffs 29 
GGC GAC GCG CCC TTG CGC AGC CTG GAG CAA GCC AAC CGC ACC CGC TTT CCC TTC TTC TCC 87 

DVKGDHRLVLAAVETTVCVL 49 
GAC GTC AAG GGC GAC CAC CGG CTG GTG CTG GCC GCG GTG GAG ACA ACC GTG CTG GTG CTC 147 

IFAVSLLGNVCALVLVARRR 69 
ATC TTT GCA GTG TCG CTG CTG GGC AAC GTG TGC GCC CTG GTG CTG GTG GCG CGC CGA CGA 207 

RRGATACLVLNLFCADLLFI 89 

CGC CGC GGC GCG ACT GCC TGC CTG GTA CTC AAC CTC TTC TGC GCG GAC CTG CTC TTC ATC 267 

SA1 PLVLAVRWTEAWLLGPV109 
AGC GCT ATC CCT CTG GTG CTG GCC GTG CGC TGG ACT GAG GCC TGG CTG CTG GGC CCC GTT 327 

ACHLLFYVMTLSGSVTILTL 129 
GCC TGC CAC CTG CTC TTC TAC GTG ATG ACC CTG AGC GGC AGC GTC ACC ATC CTC ACG CTG 387 

AAVSLERMVCIVHLQRGVRG149 
GCC GCG GTC AGC CTG GAG CGC ATG GTG TGC ATC GTG CAC CTG CAG CGC GGC GTG CGG GGT 447 

PGRRARAVLLALIWGY SAVA 169 
CCT GGG CGG CGG GCG CGG GCA GTG CTG CTG GCG CTC ATC TGG GGC TAT TCG GCG GTC GCC 507 

ALPLCVFFRVVPQRLPGADQ189 
GCT CTG CCT CTC TGC GTC TTC TTT CGA GTC GTC CCG CAA CGG CTC CCC GGC GCC GAC CAG 567 

EISICTLIWPTIPGEISWDV 209 
GAA ATT TCG ATT TGC ACA CTG ATT TGG CCC ACC ATT CCT GGA GAG ATC TCG TGG GAT GTC 627 

SFVTLNFLVPGLVIVISYSK229 
TCT TTT GTT ACT TTG AAC TTC TTG GTG CCA GGA CTG GTC ATT GTG ATC AGT TAC TCC AAA 687 

I L Q I TKASRKRLTVSLAYSE 249 
ATT TTA CAG ATC ACA AAG GCA TCA AGG AAG AGG CTC ACG GTA AGC CTG GCC TAC TCG GAG 747 

SHQIRVSQQDFRLFRTLFLL 269 
AGC CAC CAG ATC CGC GTG TCC CAG CAG GAC TTC CGG CTC TTC CGC ACC CTC TTC CTC CTC 807 

MVSFFIMWSPIIITILI.ILI289 
ATG GTC TCC TTC TTC ATC ATG TGG AGC CCC ATC ATC ATC ACC ATC CTC CTC ATC CTG ATC 867 

QN FKQDLVIWPSLFFWVVAF309 
CAG AAC TTC AAG CAA GAC CTG GTC ATC TGG CCG TCC CTC TTC TTC TGG GTG GTG GCC TTC 927 



FIG- 1 



TFANSALNPILYNMTLCRNE 329 
RCA TTT GCT AAT TCA GCC CTA AAC CCC ATC CTC TAC AAC ATG ACA CTG TGC AGG AAT GAG 987 

WKKIFCCFWFPEKGAILTDT 349 
TGG AAG AAA ATT TTT TGC TGC TTC TGG TTC CCA GAA AAG GGA GCC ATT TTA ACA GAC ACA 1047 



SVKRNDLSIISG* 362 
TCT GTC AAA AGA AAT GAC TTG TCG ATT ATT TCT GGC TAA 1086 

TTTTTCTTTATAGCCGAGTTTCT»CACCTGGCGAGCTGTGGCATGCTTTTAAACAGAGTT<^TTTCCAGTACCCTCCA 
TC^GTGCACCCTGCTTTAAGAfiAATGAACCTATGCAAATAGAC^TCC^CAGCGTCGGTAAATTAAGGGGTGATCACCAA 
GTTTCATAATATTTTCCCTTTATAAAAGGATTTGTTGGCCAGGTGCAGTGGTTCATGCCTGTAATCCCAGCAGTTTGGG 



AAAATAAAAAAAAAAATTAGCTGGGAGTGGTGGTGGGCACCTGTAATCCTAGCTACTTGGGAGGCTGAACCAGGAGAAT 



CTCTTGAACCTGGGAGGCAGAGGTTGCAGTGAGCCGAGATCGTGCCATTGCACTCCAACCAGGGCAACAAGAGTGAAAC 



TCCATCTTAAAAAAAAAAAAAAAAAGATTTGTTATGGGTTCCTTTTAAATGTGAACTTTTTTAGTGTGTTTGTAATATG 



ATCAAATTTAATAAATATTTATTTATGACTGTTCAGCAAAAAAAAAAAAAAAAAAGGGCGG 
RTA01/2057957vl 



Fir- t /'coOT^ 



Sequence Description 



7t3i_l PF00001 7 transmembrane recspsor (rhodopsin 119.9 4.7«-37 
Po.-c«<3 for domains : 

Sequence Domair. seq-f sa<J-c hmc-f broro-t score E— value _ 

7otul 1/1 57 321 1 253 {] 119.9 4.7<s-37 
Xliernaw=nts of top- scarring donaias: 

7tn._l; donwii.r, 1 of 1. from 57 to 321: Score 119-9, E = 4.7e-37 

*-><3NlIjVilv-ilrfckklrfcptnifilNIiA.vJvDL,Lflltlppwalyylv>g 

GN+ ++++++r +-r+r +t ,-+-•■ 1NL. ADLLf + p++ 4-+ 

F11H4273. 57 GNVO^VLVAR-WliaiG^^^ 101 

gBedWpfGsalCklvtalc^rvTiu^aSillLtalSiDRYXAIvrvElryrrr 
e w*i-G-«-4- +++*+4-* + il-t-L+a S++R. -f- iv 1+ -i-r 

Flftl4273, 102 EAHLLGFVACm,r.FYV^LSGSWILTI^VSLgMWCIV-HLQilQVil LIS 

r t ap^r Ak^v-il lvWvlnll lslPplI 5» wvlctveegnot lr.vnvcvCli 
-t-r *v+++l+W ++++-I-1P +f+ v + ++ +- ++ +Ctt 
F1K14273, 143 GPCRRAKA.Vr.rJvLXK<^SAVARI.PLCV5-FRVVPQJU < PG — ADQEISICTt. l!3fi 

df peesta6vstwlrflyvll st IvgFllPl IvilvcYtrlirtlr 

+ p-n. + * ++* Fl+P lvi++ V+ II + + ++++ 

Flhl4273, 197 IWPTIPG EISVTOVSFVTI^JFLVpGLVrVISYSKIIJarTKnsrXr 240 

kaaktll\nrir^FvlCWiFyfivlll«3rlC 

+ + + + +->• + ++ +tl++++v F+-J- W P 1+ + 11 4- 

Flh.14273, 241 lcvsleyseshcirvsqqdf RLFRTlr I.IMVSFFBJWSFlTXTILLIlXQ 290 

. IsiimsstCelervlptEllvtlwLayvNsolNPilY<-« 
— r + + P ++++ + +-»-i-+N9+lNPi-«-Y 

Flhl4273, 291 nFK QD&VXWPSLFFWVVAFTFANSAENP1LY 321 
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a Alpha, Regions - Garnier-Robson 
a Alpha, Regions - Chou-Fasman 

■ Beta, Regions - Gamier-Robson 

■ Beta. Regions - Chou-Fasman 
B Turn, Regions - Garnier-Robson 

■ Turn. Regions - Chou-Fasman 
d Coil, Regions - Garnier-Robson 

DHydrophilicfty Plot - Kyte-Doolittle 



m Alpha, Amphipathic Regions - Eisenberg 
a Beta, Amphipath'tc Regions - Eisenberg 
□ Flexible Regions - Karptus-Schulz 



a Antigenic Index - Jameson-Wolf 



a Surface Probability Plot - Emini 
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Analysis of Flhl4273, (362 aa) 




J J | I , t J I , I J , J I J 1 j I , , j i , . J I | < , I J I | « , « | 

1 41 81 121 161 201 241 281 321 361 



»Flhl4:273 . 1086 baaas , 1825 checksum . 

I^PSCARAACTDA-PI/RSIjEQ ANHTRF PFFSD VKC3DHR LVLAAVETTVLVL ZPAVSXiJuQ NV C 

^RLKSADQEISICTLIWOTIFQjSXSWWS^^ 
l|5^SLAYSESHQZRVSQQDFRLiFRTkFIXM^ 

Si^F»WVJiFTFWfSAIil^2LVNfcrrLCRNEWKKIFCCFWF?EKGAIDTl 
G* 



Prosite Pattern Matches for Flhl4273, 



> £Sflflfl01 1 FDOCGC 301 I ASH GLYCOSYLATICN Jv-glycosylaiion site. 



Query: 21 NRTR 24 

Ouery: 322 NMTL 325 

>PS00004 I PDOCOOOOd ICAMP, ,PHOSPKQ_SITS cAMP- and cGMP - dep<z nden t protein kinase phosphorylat 
Query: 239 KRI/T 242 

> PS00005 I PPOC00005 I PKC_PHOSPKO_SXTS Protein kinase C phosphorylation sise. 

Query: 237 SRK 239 

Query: 330 SVK 352 

> Psn9006 lPDOC00006ICK2 peos?HO_S1TE Casein kinase II phosphorylation site- 
Query: 2S6 SQQD 2S9 

s -PSOOnOfi I PDOCOOOQ8 IMYRISTYI, sr-norristoylation sit«. 

Query: 57 GNVCAL 62 

Query: 72 GATACT, 77 

Query: 343 GAXLTD 348 

>PS00009 I PDOCOOD03 IftMIDATION Araidation situ. 
Query: ISO PGKR Z53 

>PS00Q29 I PDOC00023 ir,EUCIN3_ZIPPEH Leucine zipper pattern. 
Query: 10€ LGPVACHLLFYVMTIjSGSVTIL 127 
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Transmembrane Segments Predicted 03 MEMS AT 



Start 


End 


Orient | 


Score 
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out— >ins | 
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ias—XfUt j 
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134 


out— >ins | 


4.0 


156 


177 


ins— >outj 


4.9 


209 


227 


out— >ins | 


3.7 


266 ||2S9 


;ins-^>out| 


6.5 


297 [321 |out->ins||3.2 



l^FLhl4273. 
WfiPEC&KAAGDAFLRSLEQAmTRFFFFSaVKGDHRLV^ 

' FQBiPGADQEISICTLIWPTIPGBISWDVSFVTL.NFLVPGL.vrVXSYSKILQITKASRKR 
LTVSIAYSES:^IRVS(^DFFJC.FRTLFLI^SFFIMWSPIXITILLII,IQiI^QDLVlWP 
C£LFFWVVAFTFANSAI^PII.Y1^I^HS^K^^ 

Transmembrane segments for presumed mature peptide 



Start 


End 


Orient 


Score 




[37 


ins— >out' 


4.0 




I 73 


out— >ins | 


4.0 


9S 
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ins— >out 
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1166 


out— >ins 
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|228 


ins- SKJitt 


6.5 
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|26D 


out— >ins 
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>F2hl<273 ._jnature 

LVJL.VARRRimGATACXVIJSri^C^^ 

GSVTXlVri^AV9;LFJ3iWCIVHLQRGVRGPGi^^ 

TVSLA^SESKQIKVSQQDFRLFRTLFLI>IVSF?IMWSPIIITTLL1LIQNFKQDLVIWPS 
I<FFWVVAFTFANSAll^PILyi^TI.CRWEWKKIFCCFWFPEKGAIIjTDTSVKBNDLSlISG 

F»G-6 



Input file 14273m; Output File 14273mtra 

Sequence length 1560 ; - ; ,-■ ' 

TTGCCAAGCTCAGCGTAAGCCTCTTCCACTGCAA^ " 
CTCCAGACTTGTCCGGCTTTACCCGAATCTT^ 

MSPECAQTTG 10 

CGCCATCTTCCCGGA03CGTGGGCCGGGCGCCCGGC ATG TCCfCCT GAG TGT GCA CAG ACG ACG GGC 30 

P G P S H T L D Q V - N R ■ T H F P = F " F . S D 30 

CCT GGT CCC TCG CAC ACC CTG GAC CAA GTC AAT CGC ACC CAC TTC CCT TTC TTC TCG GAT 90 

VKGDHRL.VLSVVETTVLGLI 50 

= .GTC AAG GGC GAC CAC CGG TTG GTG TTG AGC GTC GTG GAG ACC ACC GTT CTG GGA CTC ATC 150 

QFVVSLLGNVCALVLVARRRR 70 

C TTT GTC GTC TCA CTG CTG GGC AAC GTG TGT GCT CTA GTG CTG GTG GCG CGC CGT CGG CGC 210 

R G A S A S I* V L N L F C A D L L F T S 90 

CGT GGG GCG TCA GCC AGC CTG GTG CTC AAC CTC TTC TGC GCG GAT TTG CTC TTC ACC AGC 270 

J A I PLV LVVRWTEAWL LGPVV 110 

fSGCC ATC CCT CTA GTG CTC GTC GTG CGC TGG ACT GAG GCC TGG CTG TTG GGG CCC GTC GTC 330 

-CHLLFYVMTMSGSVTILTLA 130 

./ TGC CAC CTG CTC TTC TAC GTG ATG ACA ATG AGC GGC AGC GTC ACG ATC CTC ACA CTG GCC 390 

= " A V S L ERMVC IVRLRRGLSG P 150 

: GCG GTC AGC CTG GAG CGC ATG GTG TGC ATC GTG CGC CTC CGG CGC GGC TTG AGC GGC CCG 450 

Q G R R T Q A A L L A F I W G Y S A L A A 170 

n GGG CGG CGG ACT CAG GCG GCA CTG CTG GCT TTC ATA TGG GGT TAC TCG GCG CTC GCC GCG 510 

LPLYILFRVVPQRLPGGDQE 190 

CTG CCC CTC TAC ATC TTG TTC CGC GTG GTC CCG CAG CGC CTT CCC GGC GGG GAC CAG GAA 570 

IPICTLDWPNRIGEISWDVF 210 

ATT CCG ATT TGC ACA TTG GAT TGG CCC AAC CGC ATA GGA GAA ATC TCA TGG GAT GTG TTT 630 

FETLNFLVPGLVIVI SYSKI 230 

TTT GAG ACT TTG AAC TTC CTG GTG CCG GGA CTG GTC ATT GTG ATC AGT TAC TCC AAA ATT 690 

LQITKASRKRLTLSLAYSES 250 

TTA CAG ATC ACG AAA GCA TCG CGG AAG AGG CTT ACG CTG AGC TTG GCA TAC TCT GAG AGC 750 

HQIRVSQQDYRLFRTLFLLM 270 

CAC CAG ATC CGA GTG TCC CAA CAA GAC TAC CGA CTC TTC CGC ACG CTC TTC CTG CTC ATG 810 

VSFFIMWSPIIITILLILIQ 290 

GTT TCC TTC TTC ATC ATG TGG AGT CCC ATC ATC ATC ACC ATC CTC CTC ATC TTG ATC CAA 870 

NFRQDLVIWPSLFFWVVAFT 310 

AAC TTC CGG CAG GAC CTG GTC ATC TGG CCA TCC CTT TTC TTC TGG GTG GTG GCC TTC ACG 930 

FANSALNP ILYNMSLFRNEW 330 

TTT GCC AAC TCT GCC CTA AAC CCC ATA CTG TAC AAC ATG TCG CTG TTC AGG AAC GAA TGG 990 

RKIFCCFFFPEKGAIFTDTS 350 

AGG AAG ATT TTT TGC TGC TTC TTT TTT CCA GAG AAG GGA GCC ATT TTT ACA GAT ACG TCT 10 5 C 

VRRNDLSVISS* 362 

GTC AGG CGA AAT GAC TTG TCT GTT ATT TCC AGC TAA 10 Sc 

CTAGCCTCTGGTGCCAGGTGAACCACGGTGTGCATGTAAAGGGAGTTAACTTCAAGGAAAGCCCACCAGTGC3CCCTGC 



TTTAAAAATACCCGACTTCCAACAGCAGGCATCTACGGAGCCAGCAAATTAAGGAATGATCGCTCAGTATAAAAATATT 

TTTCCTTAAAAGAACTTTCTATGGGTTCCTTTTGTGAACTTTTT^ 

TTTATTTATAACGTGTTCCTACAAAAAAAAAAAAAAAAAAAAA 



7- Cowr 



Query: 14273m, 

Scores for sequence family classification (score includes all doroaisis } ; 
Sequence Description Score S-walue 



pgl ?. nnnl 7 transmembrane roctsptor (rhodopsin 



Parsed, for domains: 

aeq-£ seq-i 



7tm_l 



1/1 



57 



321 



1 259 [1 



score E— value 
118.8 X«-36 



Alignments of top-scoring domains: 

iIia_Xr doraain. 1 o£ 1, from 57 to 321r scor« 118-8, E - X«s-36 
T = i *-^<^lLVil^lrtkklrtptriifilNLAvADLLflltlppwalyylvg 
"^1 GNT+ +++*++r *~t~t-x ++ f+lKL ADIXC+ + p++ ++ +4- 

SI 14273m- 57 GNVT!/ULVI«VAR-iUUEaa=ASASL 101 

geedWpfGsalCkl^Jil^vviCTyaSillLtalSiDRYlAIvi^lryrrr 
e W++G+-r-rG+l+ +4-4-++++ + il+L+a S++R + XV lr + 
pj 14273m, 102 --EAWLWFWCHLLPYVm'KSGSVrc^ 1-13 

rfcsprrA . JcwiXXvWvlal 1 lslPpllf Swvkt veegjijtlhvnvtvc: 
'tfl rr+i+++t+++W -4-+1++1P ++++ v + ++ g ++ +c+ 

III 14273m, 14S GP-GRRTqAALIAF IWGYSALAXLPLYILFKWPQRLPGGD — QEIPICT 195 

| 3 5 idfpcescasvstwlrsyvlXstlvgFllPllvilvoTtrllrtlr. . . . 
If --d+p-4- + ++s+ +++ ++ Fi+? lv±++ Y+ 11 + + +4-4. 

y 14273m, 156 LDWEURIG ElSWDVFFETIJslFI,VPGL.vrviSYSKIlQI-rKasrk 23 S 

-T^aktllvuxrvvFvlcWlPyfivllldtl 

■*■■*■•*■ - — ■>■•** + ++— + ++ +tl++++v F++ W P i++ll + 
14273m, 240 rltlalayseshqirvsqqcyKLFaTIJLLMVSFFl^SPIIITrLT-11,1 289 

c . lsiircsEtCelervlpta.llvtlwLayvKsclNPiIY<:-* 
++ * + p ++++ + ++++Ns-rlN?i+Y 

14273m, 290 <2«FS QDLVTVfPS LFFWVVATTFANS ALN P I L>Y 321 



25 50 75 



100 125 150 175 200 225 250 275 300 325 350 
} . 




H Alpha, Regions - Garnier-Robson 
H Alpha, Regions - Chou-Fasman 
a Beta, Regions - Garnier-Robson 

■ Beta, Regions - Chou-Fasman 
a Turn, Regions - Garnier-Robson 

■ Turn, Regions - Chou-Fasman 
□ Coil, Regions - Garnier-Robson 

DHydrophific'rty Plot - Kyte-Doolittle 



a Alpha, Amphipathie Regions - Eisenberg 
a Beta, Amphipathie Regions - Eisenberg 
a Flexible Regions - Karplus-Schulz 



Q Antigenic Index - Jameson-Wolf 



Q Surface Probability Plot - Emini 
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Analysis of 14273m, (362 aa) 



PF08001 




, • j , | i | i , i j i , i j « j • | i | « j « | i | . | i J i | i | t j 
1 41 81 121 161 201 241 281 321 361 



>14273ra., 108S bases, 6943 checksum. 
MSPECAQTTGPC?SHTLIX3VNRTHFPFFSD\^OT 

AiVuVARRRRRGAS ASLVLNLr CADLLFTSAI ?r.VLVVRWTEAWIJ^GPVVCH LtiFYVMTM 
SGSVTILTI^VSL2RMVCrVRI^GLSGPGKR<^ 

PQKLPGGl^ElPIC?LDWP^IGEIS?roVF?OTr^IiVPGLVIVISYSKXIX3IT:<ASHK^ 

X,!TI^**SESHQrRVSQQDYKDFRTLFL^ 

SLFFWVAFTFANSA^NPILYNMSLFP^ 

s- 



H6r /O 



Prosite Pattern Matches for 14273m, 



3 -PS00Q.il I FPCCO3O01 : A5M_GLYCOSrJLATZOW JJ-grlycosylacion site. 



Query: 21 NRTH 24 

iiaiy: 322 HMSL 32S 

jpaooooz ipd6c00002 I ciiYC»SAMlNOGLYCAM CJyeosamijioglycari attachment site. 

HD Additional rules: 

lit HO There must be at least two acidic amino acids (Glu or Asp) from -2 to 

•.[;; RXJ -4 relative to the serine. 

Query: 143 SGPG 1S1 

UpS00Q54 I PDOC00OO4 1 CAMP PHOSPHO SITE cAMP- a D a cGKP-d^per.dcnt protein Kinase phosphorylation 

w 

paery: 23.S XJO/T 242 

>?5P0QOS i PDOCQ0005 I ?xc_?ttOSPHO_SITE Protein kinase C phosphorylation sits. 

Query: 237 S3X 239 

Query: 350 SVR 352 

>?g000Q6 j PDQC0Q006 1 CK2_?HOSPHO_SITB Casein kinase II phosphorylation sit.a. 

Query: 40 SWE 43 

QiMtryt 256 SQQD 259 

>PSQ00Ce iPlaQC0O008fMY^TSTYT N-myristoylation site. 

Query: 57 GKVCAL 62 

Que=y: 72 GAS AS L 77 
Query.- 343 GAIFTD 34e 

> PS0OO09 i PDOCOQCCS (AMTOAttom A.T.idation site. 
Q-ory: ISO PGRR 153 



Transmembrane Segments JPredicted by MLfciViSA X 
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ins— >out 


33 " 


I 113 


134 
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3.8 
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i ins— >out | 
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out— >ins (j 2.4 
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inS— >out 1 6.5 


fcLj 


321 J out~>ins jj 3.2 



Jvl4273m, 

Transmembrane segments for presumed mature peptide 




>14273ni, ^nvature 



